A Semantic-based String Matching Library

نویسنده

  • Yue Han
چکیده

String matching has been widely used in more and more web data applications. Although a lot of matching techniques have been proposed, most of them are based on evaluating the similarity of strings in terms of their lexical or syntactic structures. The fact is that the same entity can be denoted by totally different expressions for which the conventional syntactic-based string matching techniques will not be effective any more. In this paper, a semantic-based string matching library (SSML) is proposed and implemented to achieve the semantic level string matching by combining all possible patterns for specific domain and appropriate matching techniques together. Particularly, two widely used domains are analyzed in details and implemented, that is, “Date” and “Person Name”. For each domain, a metric is proposed to quantify the potentiality of string matching. In general, under the framework of SSML, it is convenient and efficient to add new domains into the system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Field Word Segmentation Model Based on Ontology in Digital Library

This paper studied a novel method based on ontology and syntax analysis. According to the disadvantages of the method based on character string matching, the method based on comprehension and the method based on statistic, we not only gave a formal definition of the model and the functions on it, but also gave a semantic query algorithm based on ontology. With the introduction of ontology techn...

متن کامل

A procedure for Web Service Selection Using WS-Policy Semantic Matching

In general, Policy-based approaches play an important role in the management of web services, for instance, in the choice of semantic web service and quality of services (QoS) in particular. The present research work illustrates a procedure for the web service selection among functionality similar web services based on WS-Policy semantic matching. In this study, the procedure of WS-Policy publi...

متن کامل

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

Distributed Query Processing in Cloud using Canonicalization Approach

Nowadays, citations and its formats act as a vital role in scientific publication digital libraries (DLs). Due to various reasons, the citation metadata extraction process make it difficult. The citation used in the extraction process is gathered from web and there is no standard style to the citations. Data gathered from the web is difficult to process due to the erroneous nature of the data. ...

متن کامل

System for Parallel Heterogeneity Resolution (SPHeRe) results for OAEI 2013

SPHeRe is an ontology matching system that utilizes cloud infrastructure for matching large scale ontologies and focus on alignment representation to be stored in the Mediation Bridge Ontology (MBO). MBO is the mediation ontology that stores all the alignments generated between the matched ontologies and represents it in a manner that provides maximum metadata information. SPHeRe is a new initi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011